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<400> 1 

atagtgctac aataaaggat ggcactgaac taccgaacct aacgcttgtg gacattgtct 60 

tgggaaattt ggacagtgga cgaatcaaga acaccattaa ttaaattggt gaagttcgat 120 

tgttgatcac cctttttggg taaacaataa ctaaggaatg gcaatatatt tgcttggaaa 180 

cgaatttgat ttaaatagca ttgaatgcaa aaaaatttaa aaggaggaga caaattccac 240 

ccttattaga agggaagttt ccattgtcat gatattatga aaattaataa gaaattatta 300 

atg get gca ctt gca ggt gca att gta gta aca ggt gga gta gga tct 348 
Met Ala Ala Leu Ala Gly Ala He Val Val Thr Gly Gly Val Gly Ser 

15 10 15 

tac gca get gat gaa cct att gat ctt gaa aaa ctt gaa gaa aaa agg 3 96 

Tyr Ala Ala Asp Glu Pro He Asp Leu Glu Lys Leu Glu Glu Lys Arg 

20 25 30 

gat aaa gaa aat gta gga aat tta cca aaa ttc gat aat gaa gtt aaa 444 
Asp Lys Glu Asn Val Gly Asn Leu Pro Lys Phe Asp Asn Glu Val Lys 
35 40 45 

gat ggt tea gaa aat cca atg get aaa tat cca gat ttc gat gat gaa 492 
Asp Gly Ser Glu Asn Pro Met Ala Lys Tyr Pro Asp Phe Asp Asp Glu 
50 55 60 

gee agt aca aga ttt gaa aca gaa aac aat gaa ttt gaa gaa aaa aaa 540 
Ala Ser Thr Arg Phe Glu Thr Glu Asn Asn Glu Phe Glu Glu Lys Lys 
65 70 75 80 

gtt gtt tct gat aac ttt ttt gat caa tea gaa cat ccg ttt gta gaa 588 
Val Val Ser Asp Asn Phe Phe Asp Gin Ser Glu His Pro Phe Val Glu 

85 90 95 

aat aaa gaa gaa aca cca gaa aca cca gaa act gat tea gaa gaa gaa 636 
Asn Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser Glu Glu Glu 

100 105 110 

gta aca ate aaa get aac eta ate ttt gca aat gga age aca caa act 684 
Val Thr He Lys Ala Asn Leu He Phe Ala Asn Gly Ser Thr Gin Thr 
115 120 125 

gca gaa ttc aaa gga aca ttt gaa aaa gca aca tea gaa get tat gcg 732 
Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu Ala Tyr Ala 
130 135 140 

tat gca gat act ttg aag aaa gac aat gga gaa tat act gta gat gtt 780 
Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val 
145 150 155 160 

gca gat aaa ggt tat act tta aat att aaa ttt get gga aaa gaa aaa 828 
Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly Lys Glu Lys 

165 170 175 

aca cca gaa gaa cca aaa gaa gaa gtt act att aaa gca aac tta ate 876 
Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala Asn Leu He 



2 



180 



185 



190 



tat gca gat 
Tyr Ala Asp 
195 

gaa gca aca 
Glu Ala Thr 
210 

aat gga gaa 
Asn Gly Glu 
225 

att aaa ttt 
lie Lys Phe 



gtt act att 
Val Thr He 



gca gaa ttc 
Ala Glu Phe 
275 

tat get gac 
Tyr Ala Asp 
290 

gca gat aaa 
Ala Asp Lys 
305 

aca cca gaa 
Thr Pro Glu 



tat gca gat 
Tyr Ala Asp 



gaa gca aca 
Glu Ala Thr 
355 

aat ggt aaa 
Asn Gly Lys 
370 

att aga ttt 
He Arg Phe 
385 

caa gta aca 
Gin Val Thr 



gga aaa aca 
Gly Lys Thr 



gca gaa gca 
Ala Glu Ala 



tat aca gta 
Tyr Thr Val 
230 

get gga aaa 

Ala Gly Lys 
245 

aaa gca aac 
Lys Ala Asn 
260 

aaa gga aca 
Lys Gly Thr 



tta tta gca 
Leu Leu Ala 



ggt tat act 
Gly Tyr Thr 
310 

gaa cca aaa 
Glu Pro Lys 
325 

gga aaa act 
Gly Lys Thr 
340 

gca gaa gca 
Ala Glu Ala 



tat aca gca 
Tyr Thr Ala 



gca ggt aag 
Ala Gly Lys 
390 

att aaa gaa 
He Lys Glu 
405 



caa aca gca 
Gin Thr Ala 
200 

tac aga tat 
Tyr Arg Tyr 
215 

gac gtt gca 
Asp Val Ala 



gaa aaa aca 
Glu Lys Thr 



tta ate tat 
Leu He Tyr 
265 

ttt gaa gaa 
Phe Glu Glu 
280 

aaa gaa aat 
Lys Glu Asn 
295 

tta aat att 
Leu Asn He 



gaa gaa gtt 
Glu Glu Val 



caa aca gca 
Gin Thr Ala 
345 

tac aga tac 
Tyr Arg Tyr 
360 

gac tta gaa 
Asp Leu Glu 
375 

aaa gtt gac 
Lys Val Asp 



aat ata tat 
Asn He Tyr 



gaa ttc aaa 
Glu Phe Lys 



gca gat gca 
Ala Asp Ala 
220 

gat aaa ggt 
Asp Lys Gly 
235 

cca gaa gaa 
Pro Glu Glu 
250 

gca gat gga 
Ala Asp Gly 



gca aca gca 
Ala Thr Ala 



ggt aaa tat 
Gly Lys Tyr 
300 

aaa ttt get 
Lys Phe Ala 
315 

act att aaa 
Thr He Lys 
330 

gag ttc aaa 
Glu Phe Lys 



get gac tta 
Ala Asp Leu 



gat ggt gga 
Asp Gly Gly 
380 

gaa aaa cca 
Glu Lys Pro 
395 

ttt gaa gat 
Phe Glu Asp 
410 



gga aca ttt 
Gly Thr Phe 
205 

tta aag aag 
Leu Lys Lys 

tat act tta 
Tyr Thr Leu 



cca aaa gaa 
Pro Lys Glu 
255 

aaa aca caa 
Lys Thr Gin 
270 

gaa gca tac 
Glu Ala Tyr 
285 

aca gta gac 
Thr Val Asp 



gga aaa gaa 
Gly Lys Glu 



gca aac tta 
Ala Asn Leu 
335 

gga aca ttt 
Gly Thr Phe 
350 

tta gca aaa 
Leu Ala Lys 
365 

tac act att 
Tyr Thr He 



gaa gaa aaa 
Glu Glu Lys 



gga aca gta 
Gly Thr Val 
415 



gaa 924 
Glu 



gac 972 
Asp 



aat 1020 

Asn 

240 

gaa 1068 
Glu 



aca 1116 
Thr 



aga 1164 
Arg 



gtt 1212 
Val 



aaa 1260 

Lys 

320 

ate 1308 
He 



gca 1356 
Ala 



gaa 1404 
Glu 



aat 1452 
Asn 



gaa 1500 

Glu 

400 

caa 1548 
Gin 
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act gca aca 
Thr Ala Thr 



aga tat gca 
Arg Tyr Ala 
435 

ttg gaa gat 
Leu Glu Asp 
450 

gaa cca gaa 
Glu Pro Glu 
465 

tea tac gaa 
Ser Tyr Glu 



gat gta aat 
Asp Val Asn 



tac tat gta 
Tyr Tyr Val 
515 

caa aat gga 
Gin Asn Gly 
530 

get ttg gaa 
Ala Leu Glu 
545 

gca gat gga 
Ala Asp Gly 



cct gaa aaa 
Pro Glu Lys 



tea aac cca 
Ser Asn Pro 
595 

gta cca tea 
Val Pro Ser 
610 

cct ggt aat 
Pro Gly Asn 
625 



ttt aaa gga 
Phe Lys Gly 
420 

gat ttg tta 
Asp Leu Leu 



ggt gga tac 
Gly Gly Tyr 



gaa aca cca 
Glu Thr Pro 
470 

gaa get gaa 
Glu Ala Glu 
485 

aaa tea tat 
Lys Ser Tyr 
500 

tta tea cca 
Leu Ser Pro 



tat gca aca 
Tyr Ala Thr 



aat gat cca 
Asn Asp Pro 
550 

aga tac tac 
Arg Tyr Tyr 
565 

cca gta gaa 
Pro Val Glu 
580 

tea aat cca 
Ser Asn Pro 



aat cct tea 
Asn Pro Ser 



gaa gaa aaa 
Glu Glu Lys 
630 



aca ttt gca 
Thr Phe Ala 
425 

tea aaa gaa 
Ser Lys Glu 
440 

act ate aac 
Thr lie Asn 
455 

gaa aaa cca 
Glu Lys Pro 



gca gca get 
Ala Ala Ala 



act att aga 
Thr lie Arg 
505 

gta gaa get 
Val Glu Ala 
520 

tac gaa gaa 
Tyr Glu Glu 
535 

ate aat aaa 
lie Asn Lys 



tat gta tta 
Tyr Val Leu 



cca tea gaa 
Pro Ser Glu 
585 

tea aca cca 
Ser Thr Pro 
600 

act cca gaa 
Thr Pro Glu 
615 

cct ggt aac 
Pro Gly Asn 



gaa gcg aca 
Glu Ala Thr 



cat ggt aaa 
His Gly Lys 



att aga ttt 
lie Arg Phe 
460 

gaa gta caa 
Glu Val Gin 
475 

aaa gaa get 
Lys Glu Ala 
490 

caa ggt gca 
Gin Gly Ala 



gaa gaa gaa 
Glu Glu Glu 



gca gaa gca 
Ala Glu Ala 
540 

tct tac age 
Ser Tyr Ser 
555 

tea cca gta 
Ser Pro Val 
570 

cca tea aca 
Pro Ser Thr 



gat gtt cca 
Asp Val Pro 



gtt cct tea 
Val Pro Ser 
620 

gaa caa aaa 
Glu Gin Lys 
635 



gca gaa gca 
Ala Glu Ala 
430 

tac aca gca 
Tyr Thr Ala 
445 

get gga aaa 
Ala Gly Lys 



gac gga tat 
Asp Gly Tyr 



ttg aaa aat 
Leu Lys Asn 
495 

gat gga aga 
Asp Gly Arg 
510 

aaa cca gaa 
Lys Pro Glu 
525 

gca get aag 
Ala Ala Lys 



ata aga caa 
lie Arg Gin 



gaa get gaa 
Glu Ala Glu 
575 

cca gat gta 
Pro Asp Val 
590 

tea act cct 
Ser Thr Pro 
605 

aac cca tea 
Asn Pro Ser 



cct ggt aac 
Pro Gly Asn 



tac 1596 
Tyr 



gac 1644 
Asp 



gaa 1692 
Glu 



gca 1740 

Ala 

480 

gat 1788 
Asp 



tat 1836 
Tyr 



gca 1884 
Ail a 



aaa 1932 
Lys 



ggt 1980 

Gly 

560 

act 2028 
Thr 



cca 2076 
Pro 



gat 2124 
Asp 



act 2172 
Thr 



gaa 22 20 

Glu 

640 
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caa aaa cct ggt aac gaa caa aaa cct ggt aat gaa caa aaa cct ggt 22 68 

Gin Lys Pro Gly Asn Glu Gin Lys Pro Gly Asn Gl.u Gin Lys Pro Gly 

645 650 655 

aat gaa caa aaa cca gac caa cct tea aaa cca gaa aaa gaa gaa aat 2316 
Asn Glu Gin Lys Pro Asp Gin Pro Ser Lys Pro Glu Lys Glu Glu Asn 

660 665 670 

ggt aag ggt gga gta gat tct cca aag aaa aaa gaa aaa get gca tta 23 64 

Gly Lys Gly Gly Val Asp Ser Pro Lys Lys Lys Glu Lys Ala Ala Leu 
675 680 685 

cca aaa get ggt age gaa get gaa ate tta aca tta gca gca get tea 2412 
Pro Lys Ala Gly Ser Glu Ala Glu lie Leu Thr Leu Ala Ala Ala Ser 
690 695 700 

tta tea age gtt gca ggt get ttc att tea ctt aaa aaa cgt aaa taa 2460 
Leu Ser Ser Val Ala Gly Ala Phe lie Ser Leu Lys Lys Arg Lys 
705 710 715 

ttaatcttag ataaaaaata gattaatcta aaaaaatggg acttttataa gtcccatttt 2520 

tgattgcaat gaaactgata caaaaaatgt atcagttttt teatttaege ttattttcct 2580 

gtgagtatgt ccaagtttgt egtataatea tctatcaaaa gectggegag tttcgctttt 2640 

tggattttgt cttcgaa 2657 



<210> 2 
<211> 719 
<212> PRT 

<213> Peptostreptococcus magnus 
<400> 2 

Met Ala Ala Leu Ala Gly Ala He Val Val Thr Gly Gly Val Gly Ser 
15 10 15 



Tyr Ala Ala Asp Glu Pro He Asp Leu Glu Lys Leu Glu Glu Lys Arg 

20 25 30 



Asp Lys Glu Asn Val Gly Asn Leu Pro Lys Phe Asp Asn Glu Val Lys 
35 40 45 



Asp Gly Ser Glu Asn Pro Met Ala Lys Tyr Pro Asp Phe Asp Asp Glu 
50 55 60 



Ala Ser Thr Arg Phe Glu Thr Glu Asn Asn Glu Phe Glu Glu Lys Lys 
65 70 75 80 



Val Val Ser Asp Asn Phe Phe Asp Gin Ser Glu His Pro Phe Val Glu 
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85 90 95 



Asn Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser Glu Glu Glu 

100 105 110 



Val Thr He Lys Ala Asn Leu He Phe Ala Asn Gly Ser Thr Gin Thr 
115 120 125 



Ala Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu Ala Tyr Ala 
130 135 140 



Tyr Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val 
145 150 155 160 



Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly Lys Glu Lys 

165 170 175 



Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala Asn Leu He 

180 185 190 



Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly Thr Phe Glu 
195 200 205 



Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu Lys Lys Asp 
210 215 220 



Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr Thr Leu Asn 
225 230 235 240 



He Lys Phe Ala Gly Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu 

245 250 255 



Val Thr He Lys Ala Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr 

260 265 270 



Ala Glu Phe Lys Gly Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg 
275 280 285 



Tyr Ala Asp Leu Leu Ala Lys Glu Asn Gly Lys Tyr Thr Val Asp Val 
290 295 300 



Ala Asp Lys Gly Tyr Thr Leu Asn He Lys Phe Ala Gly Lys Glu Lys 
305 310 315 320 
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Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala Asn Leu lie 

325 330 335 



Tyr Ala Asp Gly 

340 



Glu Ala Thr Ala 
355 



Asn Gly Lys Tyr 
370 



lie Arg Phe Ala 
385 



Gin Val Thr He 



Thr Ala Thr Phe 

420 



Arg Tyr Ala Asp 
435 



Leu Glu Asp Gly 

450 



Glu Pro Glu Glu 
465 



Ser Tyr Glu Glu 



Asp Val Asn Lys 

500 



Tyr Tyr Val Leu 
515 



Lys Thr Gin Thr 



Glu Ala Tyr Arg 

360 



Thr Ala Asp Leu 
375 



Gly Lys Lys Val 
390 



Lys Glu Asn He 
405 



Lys Gly Thr Phe 



Leu Leu Ser Lys 

440 



Gly Tyr Thr He 
455 



Thr Pro Glu Lys 
470 



Ala Glu Ala Ala 
485 



Ser Tyr Thr He 



Ser Pro Val Glu 

520 



Ala Glu Phe Lys 
345 



Tyr Ala Asp Leu 



Glu Asp Gly Gly 

380 



Asp Glu Lys Pro 
395 



Tyr Phe Glu Asp 
410 



Ala Glu Ala Thr 
425 



Glu His Gly Lys 



Asn He Arg Phe 

460 



Pro Glu Val Gin 
475 



Ala Lys Glu Ala 
490 



Arg Gin Gly Ala 
505 



Ala Glu Glu Glu 



Gly Thr Phe Ala 
350 



Leu Ala Lys Glu 
365 



Tyr Thr He Asn 



Glu Glu Lys Glu 

400 



Gly Thr Val Gin 
415 



Ala Glu Ala Tyr 
430 



Tyr Thr Ala Asp 
445 



Ala Gly Lys Glu 



Asp Gly Tyr Ala 

480 



Leu Lys Asn Asp 
495 



Asp Gly Arg Tyr 
510 



Lys Pro Glu Ala 
525 



Gin Asn Gly Tyr Ala Thr Tyr Glu Glu Ala Glu Ala Ala Ala Lys Lys 
530 535 540 



Ala Leu Glu Asn Asp Pro He Asn Lys Ser Tyr Ser He Arg Gin Gly 
545 550 555 560 



Ala Asp Gly Arg Tyr Tyr Tyr Val Leu Ser Pro Val Glu Ala Glu Thr 

565 570 575 



Pro Glu Lys Pro Val Glu Pro Ser Glu Pro Ser Thr Pro Asp Val Pro 

580 585 590 



Ser Asn Pro Ser Asn Pro Ser Thr Pro Asp Val Pro Ser Thr Pro Asp 
595 600 605 



Val Pro Ser Asn Pro Ser Thr Pro Glu Val Pro Ser Asn Pro Ser Thr 
610 615 620 



Pro Gly Asn Glu Glu Lys Pro Gly Asn Glu Gin Lys Pro Gly Asn Glu 
625 630 635 640 



Gin Lys Pro Gly Asn Glu Gin Lys Pro Gly Asn Glu Gin Lys Pro Gly 

645 650 655 



Asn Glu Gin Lys Pro Asp Gin Pro Ser Lys Pro Glu Lys Glu Glu Asn 

660 665 670 



Gly Lys Gly Gly Val Asp Ser Pro Lys Lys Lys Glu Lys Ala Ala Leu 
675 680 685 



Pro Lys Ala Gly Ser Glu Ala Glu He Leu Thr Leu Ala Ala Ala Ser 
690 695 700 



Leu Ser Ser Val Ala Gly Ala Phe He Ser Leu Lys Lys Arg Lys 
705 710 715 



<210> 3 

<211> 228 

<212> DNA 

<213> Peptostreptococcus magnus 



<220> 

<221> CDS 

<222> (1)..{228) 

<223> PpLl 

<400> 3 
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aaa gaa gaa aca cca gaa aca cca gaa act gat tea gaa gaa gaa gta 48 
Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser Glu Glu Glu Val 
15 10 15 

aca ate aaa get aac eta ate ttt gca aat gga age aca caa act gca 96 
Thr lie Lys Ala Asn Leu lie Phe Ala Asn Gly Ser Thr Gin Thr Ala 

20 25 30 

gaa ttc aaa gga aca ttt gaa aaa gca aca tea gaa get tat gcg tat 144 
Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu Ala Tyr Ala Tyr 
35 40 45 

gca gat act ttg aag aaa gac aat gga gaa tat act gta gat gtt gca 192 
Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala 
50 55 60 

gat aaa ggt tat act tta aat att aaa ttt get gga 228 
Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
65 70 75 



<210> 4 
<211> 76 
<212> PRT 

<213> Peptostreptococcus magnus 
<400> 4 

Lys Glu Glu Thr Pro Glu Thr Pro Glu Thr Asp Ser Glu Glu Glu Val 
15 10 15 



Thr lie Lys Ala Asn Leu lie Phe Ala Asn Gly Ser Thr Gin Thr Ala 

20 25 30 



Glu Phe Lys Gly Thr Phe Glu Lys Ala Thr Ser Glu Ala Tyr Ala Tyr 
35 40 45 



Ala Asp Thr Leu Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala 
50 55 60 



Asp Lys Gly Tyr Thr Leu Asn lie Lys Phe Ala Gly 
65 70 75 



<210> 5 

<211> 216 

<212> DNA 

<213> Peptostreptococcus magnus 



<220> 

<221> CDS 

<222> (1) . . (216) 
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<223> PpL2 



<400> 5 

aaa gaa aaa aca cca gaa gaa cca aaa gaa gaa gtt act att aaa gca 48 
Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 

15 10 15 

aac tta ate tat gca gat gga aaa aca caa aca gca gaa ttc aaa gga 96 
Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

20 25 30 

aca ttt gaa gaa gca aca gca gaa gca tac aga tat gca gat gca tta 144 
Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 

35 40 45 

aag aag gac aat gga gaa tat aca gta gac gtt gca gat aaa ggt tat 192 
Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
50 55 60 

act tta aat att aaa ttt get gga 216 
Thr Leu Asn lie Lys Phe Ala Gly 
65 70 



<210> 6 
<211> 72 
<212> PRT 

<213> Peptostreptococcus magnus 
<400> 6 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
15 10 15 



Asn Leu lie Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

20 25 30 



Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Ala Leu 
35 40 45 



Lys Lys Asp Asn Gly Glu Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
50 55 60 



Thr Leu Asn lie Lys Phe Ala Gly 
65 70 



<210> 7 

<211> 216 

<212> DNA 

<213> Peptostreptococcus magnus 
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<220> 

<221> CDS 

<222> (1) . . (216) 

<223> PpL3 

<400> 7 

aaa gaa aaa aca cca gaa gaa cca aaa gaa gaa gtt act att aaa gca 48 
Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
15 10 15 

aac tta ate tat gca gat gga aaa aca caa aca gca gaa ttc aaa gga 96 
Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

20 25 30 

aca ttt gaa gaa gca aca gca gaa gca tac aga tat get gac tta tta 144 
Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
35 40 45 

gca aaa gaa aat ggt aaa tat aca gta gac gtt gca gat aaa ggt tat 192 
Ala Lys Glu Asn Gly Lys Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
50 55 60 

act tta aat att aaa ttt get gga 216 
Thr Leu Asn He Lys Phe Ala Gly 
65 70 



<210> 8 
<211> 72 
<212> PRT 

<213> Peptostreptococcus magnus 
<400> 8 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
15 10 15 



Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

20 25 30 



Thr Phe Glu Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
35 40 45 



Ala Lys Glu Asn Gly Lys Tyr Thr Val Asp Val Ala Asp Lys Gly Tyr 
50 55 60 



Thr Leu Asn He Lys Phe Ala Gly 
65 70 



<210> 9 
<211> 216 
<212> DNA 



<213> Peptostreptococcus magnus 



<220> 

<221> CDS 

<222> (1) . . (216) 

<223> PpL4 

<400> 9 

aaa gaa aaa aca cca gaa gaa cca aaa gaa gaa gtt act att aaa gca 48 
Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr lie Lys Ala 
15 10 15 

aac tta ate tat gca gat gga aaa act caa aca gca gag ttc aaa gga 96 
Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

20 25 30 

aca ttt gca gaa gca aca gca gaa gca tac aga tac get gac tta tta 144 
Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
35 40 45 

gca aaa gaa aat ggt aaa tat aca gca gac tta gaa gat ggt gga tac 192 
Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
50 55 60 

act att aat att aga ttt gca ggt 216 
Thr He Asn He Arg Phe Ala Gly 
65 70 



<210> 10 
<211> 72 
<212> PRT 

<213> Peptostreptococcus magnus 
<400> 10 

Lys Glu Lys Thr Pro Glu Glu Pro Lys Glu Glu Val Thr He Lys Ala 
15 10 15 



Asn Leu He Tyr Ala Asp Gly Lys Thr Gin Thr Ala Glu Phe Lys Gly 

20 25 30 



Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu Leu 
35 40 45 



Ala Lys Glu Asn Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly Tyr 
50 55 60 



Thr He Asn He Arg Phe Ala Gly 
65 70 
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<210> 11 

<211> 219 

<212> DNA 

<213> Peptostreptococcus magnus 



<220> 

<221> CDS 

<222> (1)..(219) 

<223> PpL5 

<400> 11 

aag aaa gtt gac gaa aaa cca gaa gaa aaa gaa caa gta aca att aaa 48 
Lys Lys Val Asp Glu Lys Pro Glu Glu Lys Glu Gin Val Thr lie Lys 
15 10 15 

gaa aat ata tat ttt gaa gat gga aca gta caa act gca aca ttt aaa 96 
Glu Asn lie Tyr Phe Glu Asp Gly Thr Val Gin Thr Ala Thr Phe Lys 

20 25 30 

gga aca ttt gca gaa gcg aca gca gaa gca tac aga tat gca gat ttg 144 
Gly Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu 
35 40 45 

tta tea aaa gaa cat ggt aaa tac aca gca gac ttg gaa gat ggt gga 192 
Leu Ser Lys Glu His Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly 
50 55 60 

tac act ate aac att aga ttt get gga 219 
Tyr Thr lie Asn lie Arg Phe Ala Gly 
65 70 



<210> 12 

<211> 73 

<212> PRT 

<213> Peptostreptococcus magnus 

<400> 12 

Lys Lys Val Asp Glu Lys Pro Glu Glu Lys Glu Gin Val Thr He Lys 
l" 5 10 15 



Glu Asn He Tyr Phe Glu Asp Gly Thr Val Gin Thr Ala Thr Phe Lys 

20 25 30 



Gly Thr Phe Ala Glu Ala Thr Ala Glu Ala Tyr Arg Tyr Ala Asp Leu 
35 40 45 



Leu Ser Lys Glu His Gly Lys Tyr Thr Ala Asp Leu Glu Asp Gly Gly 
50 "* 55 60 



Tyr Thr He Asn He Arg Phe Ala Gly 



13 



65 



70 



<210> 13 

<211> 1269 

<212> DNA 

<213> Sindbis virus 



<220> 

<221> misc_f eature 

<222> (1)..(1269) 

<223> E2 potein gene from strain AR339 



<220> 

<221> CDS 

<222> (1) . . (1269) 

<223> E2 glycoprotein 



<220> 

<221> variation 

<222> (3) . . (3) 

<223> a in AR339; c in HRsp 



<220> 

<221> variation 

<222> (8).. (8) 

<223> c in AR339; t in HRsp 



<220> 

<221> variation 

<222> (68).. (68) 

<223> a in AR339; t in HRsp 



<220> 

<221> variation 

<222> (208) . . (208) 

<223> g in AR339; a in HRsp 



<220> 

<221> variation 

<222> (514) . . (514) 

<223> g in AR339; a in HRsp 



<400> 13 

aga gtc act gac gac ttt acc ctg acc age ccc tac ttg ggc aca tgc 

Arg Val Thr Asp Asp Phe Thr Leu Thr Ser Pro Tyr Leu Gly Thr Cys 
15 10 15 

teg tac tgc cac cat act gaa ccg tgc ttc age cct gtt aag ate gag 
Ser Tyr Cys His His Thr Glu Pro Cys Phe Ser Pro Val Lys lie Glu 

20 25 30 



cag gtc tgg gac gaa gcg gac gat aac acc ata cgc ata cag act tec 144 
Gin Val Trp Asp Glu Ala Asp Asp Asn Thr lie Arg lie Gin Thr Ser 
35 40 45 
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gcc cag ttt gga tac gac caa age gga gca gca age gca aac aag tac 192 
Ala Gin Phe Gly Tyr Asp Gin Ser Gly Ala Ala Ser Ala Asn Lys Tyr 
50 55 60 

cgc tac atg teg ctt gag cag gat cac acc gtt aaa gaa ggc acc atg 240 
Arg Tyr Met Ser Leu Glu Gin Asp His Thr Val Lys Glu Gly Thr Met 
65 70 75 80 

gat gac ate aag att age acc tea gga ccg tgt aga agg ctt age tac 288 
Asp Asp lie Lys lie Ser Thr Ser Gly Pro Cys Arg Arg Leu Ser Tyr 

85 90 95 

aaa gga tac ttt etc etc gca aaa tgc cct cca ggg gac age gta acg 33 6 

Lys Gly Tyr Phe Leu Leu Ala Lys Cys Pro Pro Gly Asp Ser Val Thr 

100 105 110 

gtt age ata gtg agt age aac tea gca acg tea tgt aca ctg gcc cgc 3 84 

Val Ser lie Val Ser Ser Asn Ser Ala Thr Ser Cys Thr Leu Ala Arg 
115 120 125 

aag ata aaa cca aaa ttc gtg gga egg gaa aaa tat gat eta cct ccc 432 
Lys lie Lys Pro Lys Phe Val Gly Arg Glu Lys Tyr Asp Leu Pro Pro 
130 135 140 

gtt cac ggt aaa aaa att cct tgc aca gtg tac gac cgt ctg aaa gaa 480 
Val His Gly Lys Lys lie Pro Cys Thr Val Tyr Asp Arg Leu Lys Glu 
145 150 155 160 

aca act gca ggc tac ate act atg cac agg ccg gga ccg cac get tat 528 
Thr Thr Ala Gly Tyr lie Thr Met His Arg Pro Gly Pro His Ala Tyr 

165 170 175 



aca tec tac ctg gaa gaa tea tea ggg aaa gtt tac gca aag ccg cca 
Thr Ser Tyr Leu Glu Glu Ser Ser Gly Lys Val Tyr Ala Lys Pro Pro 

180 185 190 



576 



tct ggg aag aac att acg tat gag tgc aag tgc ggc gac tac aag acc 624 
Ser Gly Lys Asn lie Thr Tyr Glu Cys Lys Cys Gly Asp Tyr Lys Thr 
195 200 205 

gga acc gtt teg acc cgc acc gaa ate act ggt tgc acc gcc ate aag 672 
Gly Thr Val Ser Thr Arg Thr Glu He Thr Gly Cys Thr Ala He Lys 
210 215 220 

cag tgc gtc gcc tat aag age gac caa acg aag tgg gtc ttc aac tea 720 
Gin Cys Val Ala Tyr Lys Ser Asp Gin Thr Lys Trp Val Phe Asn Ser 
225 230 235 240 

ccg gac ttg ate aga cat gac gac cac acg gcc caa ggg aaa ttg cat 768 
Pro Asp Leu He Arg His Asp Asp His Thr Ala Gin Gly Lys Leu His 

245 250 255 

ttg cct ttc aag ttg ate ccg agt acc tgc atg gtc cct gtt gcc cac 816 
Leu Pro Phe Lys Leu He Pro Ser Thr Cys Met Val Pro Val Ala His 

260 265 270 

gcg ccg aat gta ata cat ggc ttt aaa cac ate age etc caa tta gat 864 
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Ala Pro Asn Val lie His Gly Phe Lys His lie Ser Leu Gin Leu Asp 
275 280 285 



aca gac cac ttg aca ttg etc acc acc agg aga eta ggg gca aac ccg 912 
Thr Asp His Leu Thr Leu Leu Thr Thr Arg Arg Leu Gly Ala Asn Pro 
290 295 300 

gaa cca acc act gaa tgg ate gtc gga aag acg gtc aga aac ttc acc 960 
Glu Pro Thr Thr Glu Trp lie Val Gly Lys Thr Val Arg Asn Phe Thr 
305 310 315 320 

gtc gac cga gat ggc ctg gaa tac ata tgg gga aat cat gag cca gtg 1008 
Val Asp Arg Asp Gly Leu Glu Tyr He Trp Gly Asn His Glu Pro Val 

325 330 335 

agg gtc tat gec caa gag tea gca cca gga gac cct cac gga tgg cca 1056 
Arg Val Tyr Ala Gin Glu Ser Ala Pro Gly Asp Pro His Gly Trp Pro 

340 345 350 

cac gaa ata gta cag cat tac tac cat cgc cat cct gtg tac acc ate 1104 
His Glu He Val Gin His Tyr Tyr His Arg His Pro Val Tyr Thr He 
355 360 365 

tta gec gtc gca tea get acc gtg gcg atg atg att ggc gta acc gtt 1152 
Leu Ala Val Ala Ser Ala Thr Val Ala Met Met He Gly Val Thr Val 
370 375 380 

gca gtg tta tgt gec tgt aaa gcg cgc cgt gag tgc ctg acg cca tac 1200 
Ala Val Leu Cys Ala Cys Lys Ala Arg Arg Glu Cys Leu Thr Pro Tyr 
385 390 395 400 

gec ctg gee cca aac gee gta ate cca act teg ctg gca etc ttg tgc 1248 
Ala Leu Ala Pro Asn Ala Val He Pro Thr Ser Leu Ala Leu Leu Cys 

405 410 415 

tgc gtt agg teg gec aat get 1269 
Cys Val Arg Ser Ala Asn Ala 

420 



<210> 14 
<211> 423 
<212> PRT 

<213> Sindbis virus 
<400> 14 

Arg Val Thr Asp Asp Phe Thr Leu Thr Ser Pro Tyr Leu Gly Thr Cys 
15 10 15 



Ser Tyr Cys His His Thr Glu Pro Cys Phe Ser Pro Val Lys He Glu 

20 25 30 



Gin Val Trp Asp Glu Ala Asp Asp Asn Thr He Arg He Gin Thr Ser 
35 40 45 
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Ala Gin Phe Gly 
50 



Arg Tyr Met Ser 
65 



Asp Asp lie Lys 



Lys Gly Tyr Phe 

100 



Val Ser He Val 
115 



Lys He Lys Pro 
130 



Val His Gly Lys 
145 



Thr Thr Ala Gly 



Thr Ser Tyr Leu 

180 



Ser Gly Lys Asn 
195 



Gly Thr Val Ser 
210 



Gin Cys Val Ala 
225 



Pro Asp Leu He 



Leu Pro Phe Lys 

260 



Tyr Asp Gin Ser 
55 



Leu Glu Gin Asp 
70 



He Ser Thr Ser 
85 



Leu Leu Ala Lys 



Ser Ser Asn Ser 

120 



Lys Phe Val Gly 
135 



Lys He Pro Cys 
150 



Tyr He Thr Met 
165 



Glu Glu Ser Ser 



He Thr Tyr Glu 

200 



Thr Arg Thr Glu 
215 



Tyr Lys Ser Asp 
230 



Arg His Asp Asp 
245 



Leu He Pro Ser 



Gly Ala Ala Ser 

60 



His Thr Val Lys 
75 



Gly Pro Cys Arg 
90 



Cys Pro Pro Gly 
105 



Ala Thr Ser Cys 



Arg Glu Lys Tyr 

140 



Thr Val Tyr Asp 
155 



His Arg Pro Gly 
170 



Gly Lys Val Tyr 
185 



Cys Lys Cys Gly 



He Thr Gly Cys 

220 



Gin Thr Lys Trp 
235 



His Thr Ala Gin 
250 



Thr Cys Met Val 
265 



Ala Asn Lys Tyr 



Glu Gly Thr Met 

80 



Arg Leu Ser Tyr 
95 



Asp Ser Val Thr 
110 



Thr Leu Ala Arg 
125 



Asp Leu Pro Pro 



Arg Leu Lys Glu 

160 



Pro His Ala Tyr 
175 



Ala Lys Pro Pro 
190 



Asp Tyr Lys Thr 
205 



Thr Ala He Lys 



Val Phe Asn Ser 

240 



Gly Lys Leu His 
255 



Pro Val Ala His 
270 
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Ala Pro Asn Val lie His Gly Phe Lys His lie Ser Leu Gin Leu Asp 
275 280 285 



Thr Asp His Leu Thr Leu Leu Thr Thr Arg Arg Leu Gly Ala Asn Pro 
290 295 300 



Glu Pro Thr Thr Glu Trp lie Val Gly Lys Thr Val Arg Asn Phe Thr 
305 310 315 320 



Val Asp Arg Asp Gly Leu Glu Tyr lie Trp Gly Asn His Glu Pro Val 

325 330 335 



Arg Val Tyr Ala Gin Glu Ser Ala Pro Gly Asp Pro His Gly Trp Pro 

340 345 350 



His Glu lie Val Gin His Tyr Tyr His Arg His Pro Val Tyr Thr lie 
355 360 365 



Leu Ala Val Ala Ser Ala Thr Val Ala Met Met He Gly Val Thr Val 
370 375 380 



Ala Val Leu Cys Ala Cys Lys Ala Arg Arg Glu Cys Leu Thr Pro Tyr 
385 390 395 400 



Ala Leu Ala Pro Asn Ala Val He Pro Thr Ser Leu Ala Leu Leu Cys 

405 410 415 



Cys Val Arg Ser Ala Asn Ala 

420 



<210> 15 

<211> 11703 

<212> DNA 

<213> Sindbis virus 



<220> 

<221> misc„f eature 

<222> (1) . . (11703) 

<223> Sindbis virus HRsp and wild-type strains complete genome 
<220> 

<221> CDS 

<222> (8439) . . (8630) 

<223> E3 protein 
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<220> 

<221> CDS 

<222> (8631) . . (9899) 
<223> E2 protein 

<400> 15 

attgacggcg tagtacacac tattgaatca aacagccgac caattgcact accatcacaa 60 

tggagaagcc agtagtaaac gtagacgtag acccccagag tccgtttgtc gtgcaactgc 12 0 

aaaaaagctt cccgcaattt gaggtagtag cacagcaggt cactccaaat gaccatgcta 180 

atgccagagc attttcgcat ctggccagta aactaatcga gctggaggtt cctaccacag 240 

cgacgatctt ggacataggc agcgcaccgg ctcgtagaat gttttccgag caccagtatc 3 00 

attgtgtctg ccccatgcgt agtccagaag acccggaccg catgatgaaa tacgccagta 360 

aactggcgga aaaagcgtgc aagattacaa acaagaactt gcatgagaag attaaggatc 420 

tccggaccgt acttgatacg ccggatgctg aaacaccatc gctctgcttt cacaacgatg 480 

ttacctgcaa catgcgtgcc gaatattccg tcatgcagga cgtgtatatc aacgctcccg 540 

gaactatcta tcatcaggct atgaaaggcg tgcggaccct gtactggatt ggcttcgaca 600 

ccacccagtt catgttctcg gctatggcag gttcgtaccc tgcgtacaac accaactggg 660 

ccgacgagaa agtccttgaa gcgcgtaaca tcggactttg cagcacaaag ctgagtgaag 720 

gtaggacagg aaaattgtcg ataatgagga agaaggagtt gaagcccggg tcgcgggttt 780 

atttctccgt aggatcgaca ctttatccag aacacagagc cagcttgcag agctggcatc 840 

ttccatcggt gttccacttg aatggaaagc agtcgtacac ttgccgctgt gatacagtgg 900 

tgagttgcga aggctacgta gtgaagaaaa tcaccatcag tcccgggatc acgggagaaa 960 

ccgtgggata cgcggttaca cacaatagcg agggcttctt gctatgcaaa gttactgaca 102 0 

cagtaaaagg agaacgggta tcgttccctg tgtgcacgta catcccggcc accatatgcg 1080 

atcagatgac tggtataatg gccacggata tatcacctga cgatgcacaa aaacttctgg 1140 

ttgggctcaa ccagcgaatt gtcattaacg gtaggactaa caggaacacc aacaccatgc 1200 

aaaattacct tctgccgatc atagcacaag ggttcagcaa atgggctaag gagcgcaagg 12 60 

atgatcttga taacgagaaa atgctgggta ctagagaacg caagcttacg tatggctgct 1320 

tgtgggcgtt tcgcactaag aaagtacatt cgttttatcg cccacctgga acgcagacct 13 80 

gcgtaaaagt cccagcctct tttagcgctt ttcccatgtc gtccgtatgg acgacctctt 1440 

tgcccatgtc gctgaggcag aaattgaaac tggcattgca accaaagaag gaggaaaaac 1500 

tgctgcaggt ctcggaggaa ttagtcatgg aggccaaggc tgcttttgag gatgctcagg 1560 



aggaagccag agcggagaag ctccgagaag cacttccacc attagtggca gacaaaggca 162 0 

tcgaggcagc cgcagaagtt gtctgcgaag tggaggggct ccaggcggac atcggagcag 1680 

cattagttga aaccccgcgc ggtcacgtaa ggataatacc tcaagcaaat gaccgtatga 1740 

tcggacagta tatcgttgtc tcgccaaact ctgtgctgaa gaatgccaaa ctcgcaccag 1800 

cgcacccgct agcagatcag gttaagatca taacacactc cggaagatca ggaaggtacg 1860 

cggtcgaacc atacgacgct aaagtactga tgccagcagg aggtgccgta ccatggccag 1920 

aattcctagc actgagtgag agcgccacgt tagtgtacaa cgaaagagag tttgtgaacc 1980 

gcaaactata ccacattgcc atgcatggcc ccgccaagaa tacagaagag gagcagtaca 2040 

aggttacaaa ggcagagctt gcagaaacag agtacgtgtt tgacgtggac aagaagcgtt 2100 

gcgttaagaa ggaagaagcc tcaggtctgg tcctctcggg agaactgacc aaccctccct 2160 

atcatgagct agctctggag ggactgaaga cccgacctgc ggtcccgtac aaggtcgaaa 222 0 

caataggagt gataggcaca ccggggtcgg gcaagtcagc tattatcaag tcaactgtca 2280 

cggcacgaga tcttgttacc agcggaaaga aagaaaattg tcgcgaaatt gaggccgacg 2340 

tgctaagact gaggggtatg cagattacgt cgaagacagt agattcggtt atgctcaacg 2400 

gatgccacaa agccgtagaa gtgctgtacg ttgacgaagc gttcgcgtgc cacgcaggag 2460 

cactacttgc cttgattgct atcgtcaggc cccgcaagaa ggtagtacta tgcggagacc 2520 

ccatgcaatg cggattcttc aacatgatgc aactaaaggt acatttcaat caccctgaaa 2580 

aagacatatg caccaagaca ttctacaagt atatctcccg gcgttgcaca cagccagtta 2640 

cagctattgt atcgacactg cattacgatg gaaagatgaa aaccacgaac ccgtgcaaga 2700 

agaacattga aatcgatatt acaggggcca caaagccgaa gccaggggat atcatcctga 2760 

catgtttccg cgggtgggtt aagcaattgc aaatcgacta tcccggacat gaagtaatga 2820 

cagccgcggc ctcacaaggg ctaaccagaa aaggagtgta tgccgtccgg caaaaagtca 2880 

atgaaaaccc actgtacgcg atcacatcag agcatgtgaa cgtgttgctc acccgcactg 2940 

aggacaggct agtgtggaaa accttgcagg gcgacccatg gattaagcag cccactaaca 3 000 

tacctaaagg aaactttcag gctactatag aggactggga agctgaacac aagggaataa 3060 

ttgctgcaat aaacagcccc actccccgtg ccaatccgtt cagctgcaag accaacgttt 3120 

gctgggcgaa agcattggaa ccgatactag ccacggccgg tatcgtactt accggttgcc 3180 

agtggagcga actgttccca cagtttgcgg atgacaaacc acattcggcc atttacgcct 3240 



20 



tagacgtaat ttgcattaag tttttcggca tggacttgac aagcggactg ttttctaaac 3300 

agagcatccc actaacgtac catcccgccg attcagcgag gccggtagct cattgggaca 3360 

acagcccagg aacccgcaag tatgggtacg atcacgccat tgccgccgaa ctctcccgta 3420 

gatttccggt gttccagcta gctgggaagg gcacacaact tgatttgcag acggggagaa 3480 

ccagagttat ctctgcacag cataacctgg tcccggtgaa ccgcaatctt cctcacgcct 3540 

tagtccccga gtacaaggag aagcaacccg gcccggtcaa aaaattcttg aaccagttca 3 600 

aacaccactc agtacttgtg gtatcagagg aaaaaattga agctccccgt aagagaatcg 3660 

aatggatcgc cccgattggc atagccggtg cagataagaa ctacaacctg gctttcgggt 3720 

ttccgccgca ggcacggtac gacctggtgt tcatcaacat tggaactaaa tacagaaacc 3780 

accactttca gcagtgcgaa gaccatgcgg cgaccttaaa aaccctttcg cgttcggccc 3 840 

tgaattgcct taacccagga ggcaccctcg tggtgaagtc ctatggctac gccgaccgca 3900 

acagtgagga cgtagtcacc gctcttgcca gaaagtttgt cagggtgtct gcagcgagac 3 960 

cagattgtgt ctcaagcaat acagaaatgt acctgatttt ccgacaacta gacaacagcc 4020 

gtacacggca attcaccccg caccatctga attgcgtgat ttcgtccgtg tatgagggta 4080 

caagagatgg agttggagcc gcgccgtcat accgcaccaa aagggagaat attgctgact 4140 

gtcaagagga agcagttgtc aacgcagcca atccgctggg tagaccaggc gaaggagtct 42 00 

gccgtgccat ctataaacgt tggccgacca gttttaccga ttcagccacg gagacaggca 42 60 

ccgcaagaat gactgtgtgc ctaggaaaga aagtgatcca cgcggtcggc cctgatttcc 4320 

ggaagcaccc agaagcagaa gccttgaaat tgctacaaaa cgcctaccat gcagtggcag 43 80 

acttagtaaa tgaacataac atcaagtctg tcgccattcc actgctatct acaggcattt 4440 

acgcagccgg aaaagaccgc cttgaagtat cacttaactg cttgacaacc gcgctagaca 4500 

gaactgacgc ggacgtaacc atctattgcc tggataagaa gtggaaggaa agaatcgacg 4 560 

cggcactcca acttaaggag tctgtaacag agctgaagga tgaagatatg gagatcgacg 462 0 

atgagttagt atggatccat ccagacagtt gcttgaaggg aagaaaggga ttcagtacta 4680 

caaaaggaaa attgtattcg tacttcgaag gcaccaaatt ccatcaagca gcaaaagaca 4740 

tggcggagat aaaggtcctg ttccctaatg accaggaaag taatgaacaa ctgtgtgcct 4800 

acatattggg tgagaccatg gaagcaatcc gcgaaaagtg cccggtcgac cataacccgt 4860 

cgtctagccc gcccaaaacg ttgccgtgcc tttgcatgta tgccatgacg ccagaaaggg 4920 

tccacagact tagaagcaat aacgtcaaag aagttacagt atgctcctcc accccccttc 4980 
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ctaagcacaa aattaagaat gttcagaagg ttcagtgcac gaaagtagtc ctgtttaatc 5040 

cgcacactcc cgcattcgtt cccgcccgta agtacataga agtgccagaa cagcctaccg 5100 

ctcctcctgc acaggccgag gaggcccccg aagttgtagc gacaccgtca ccatctacag 5160 

ctgataacac ctcgcttgat gtcacagaca tctcactgga tatggatgac agtagcgaag 5220 

gctcactttt ttcgagcttt agcggatcgg acaactctat tactagtatg gacagttggt 5280 

cgtcaggacc tagttcacta gagatagtag accgaaggca ggtggtggtg gctgacgttc 5340 

atgccgtcca agagcctgcc cctattccac cgccaaggct aaagaagatg gcccgcctgg 5400 

cagcggcaag aaaagagccc actccaccgg caagcaatag ctctgagtcc ctccacctct 5460 

cttttggtgg ggtatccatg tccctcggat caattttcga cggagagacg gcccgccagg 5520 

cagcggtaca acccctggca acaggcccca cggatgtgcc tatgtctttc ggatcgtttt 5580 

ccgacggaga gattgatgag ctgagccgca gagtaactga gtccgaaccc gtcctgtttg 5640 

gatcatttga accgggcgaa gtgaactcaa ttatatcgtc ccgatcagcc gtatcttttc 5700 

cactacgcaa gcagagacgt agacgcagga gcaggaggac tgaatactga ctaaccgggg 57 60 

taggtgggta catattttcg acggacacag gccctgggca cttgcaaaag aagtccgttc 5820 

tgcagaacca gcttacagaa ccgaccttgg agcgcaatgt cctggaaaga attcatgccc 5880 

cggtgctcga cacgtcgaaa gaggaacaac tcaaactcag gtaccagatg atgcccaccg 5940 

aagccaacaa aagtaggtac cagtctcgta aagtagaaaa tcagaaagcc ataaccactg 6000 

agcgactact gtcaggacta cgactgtata actctgccac agatcagcca gaatgctata 6060 

agatcaccta tccgaaacca ttgtactcca gtagcgtacc ggcgaactac tccgatccac 6120 

agttcgctgt agctgtctgt aacaactatc tgcatgagaa ctatccgaca gtagcatctt 6180 

atcagattac tgacgagtac gatgcttact tggatatggt agacgggaca gtcgcctgcc 6240 

tggatactgc aaccttctgc cccgctaagc ttagaagtta cccgaaaaaa catgagtata 6300 

gagccccgaa tatccgcagt gcggttccat cagcgatgca gaacacgcta caaaatgtgc 63 60 

tcattgccgc aactaaaaga aattgcaacg tcacgcagat gcgtgaactg ccaacactgg 642 0 

actcagcgac attcaatgtc gaatgctttc gaaaatatgc atgtaatgac gagtattggg 6480 

aggagttcgc tcggaagcca attaggatta ccactgagtt tgtcaccgca tatgtagcta 6540 

gactgaaagg ccctaaggcc gccgcactat ttgcaaagac gtataatttg gtcccattgc 6600 

aagaagtgcc tatggataga ttcgtcatgg acatgaaaag agacgtgaaa gttacaccag 6660 
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gcacgaaaca cacagaagaa agaccgaaag tacaagtgat acaagccgca gaacccctgg 6720 

cgactgctta cttatgcggg attcaccggg aattagtgcg taggcttacg gccgtcttgc 6780 

ttccaaacat tcacacgctt tttgacatgt cggcggagga ttttgatgca atcatagcag 6840 

aacacttcaa gcaaggcgac ccggtactgg agacggatat cgcatcattc gacaaaagcc 6900 

aagacgacgc tatggcgtta accggtctga tgatcttgga ggacctgggt gtggatcaac 6960 

cactactcga cttgatcgag tgcgcctttg gagaaatatc atccacccat ctacctacgg 7020 

gtactcgttt taaattcggg gcgatgatga aatccggaat gttcctcaca ctttttgtca 7080 

acacagtttt gaatgtcgtt atcgccagca gagtactaga agagcggctt aaaacgtcca 7140 

gatgtgcagc gttcattggc gacgacaaca tcatacatgg agtagtatct gacaaagaaa 7200 

tggctgagag gtgcgccacc tggctcaaca tggaggttaa gatcatcgac gcagtcatcg 7260 

gtgagagacc accttacttc tgcggcggat ttatcttgca agattcggtt acttccacag 7320 

cgtgccgcgt ggcggatccc ctgaaaaggc tgtttaagtt gggtaaaccg ctcccagccg 73 80 

acgacgagca agacgaagac agaagacgcg ctctgctaga tgaaacaaag gcgtggttta 7440 

gagtaggtat aacaggcact ttagcagtgg ccgtgacgac ccggtatgag gtagacaata 7500 

ttacacctgt cctactggca ttgagaactt ttgcccagag caaaagagca ttccaagcca 7560 

tcagagggga aataaagcat ctctacggtg gtcctaaata gtcagcatag tacatttcat 7620 

ctgactaata ctacaacacc accaccatga atagaggatt ctttaacatg ctcggccgcc 7680 

gccccttccc ggcccccact gccatgtgga ggccgcggag aaggaggcag gcggccccga 7740 

tgcctgcccg caacgggctg gcttctcaaa tccagcaact gaccacagcc gtcagtgccc 7800 

tagtcattgg acaggcaact agacctcaac ccccacgtcc acgcccgcca ccgcgccaga 7860 

agaagcaggc gcccaagcaa ccaccgaagc cgaagaaacc aaaaacgcag gagaagaaga 7920 

agaagcaacc tgcaaaaccc aaacccggaa agagacagcg catggcactt aagttggagg 7980 

ccgacagatt gttcgacgtc aagaacgagg acggagatgt catcgggcac gcactggcca 8040 

tggaaggaaa ggtaatgaaa cctctgcacg tgaaaggaac catcgaccac cctgtgctat 8100 

caaagctcaa atttaccaag tcgtcagcat acgacatgga gttcgcacag ttgccagtca 8160 

acatgagaag tgaggcattc acctacacca gtgaacaccc cgaaggattc tataactggc 8220 

accacggagc ggtgcagtat agtggaggta gatttaccat ccctcgcgga gtaggaggca 8280 

gaggagacag cggtcgtccg atcatggata actccggtcg ggttgtcgcg atagtcctcg 8340 

gtggcgctga tgaaggaaca cgaactgccc tttcggtcgt cacctggaat agtaaaggga 8400 
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agacaattaa gacgaccccg gaagggacag aagagtgg tec gca gca cca ctg gtc 8456 

Ser Ala Ala Pro Leu Val 
1 5 



acg gca atg tgt ttg etc gga aat gtg age ttc cca tgc gac cgc ccg 8504 
Thr Ala Met Cys Leu Leu Gly Asn Val Ser Phe Pro Cys Asp Arg Pro 

10 15 20 

ccc aca tgc tat ace cgc gaa cct tec aga gee etc gac ate ctt gaa 8552 
Pro Thr Cys Tyr Thr Arg Glu Pro Ser Arg Ala Leu Asp lie Leu Glu 
25 30 35 

gag aac gtg aac cat gag gec tac gat ace ctg etc aat gee ata ttg 8600 
Glu Asn Val Asn His Glu Ala Tyr Asp Thr Leu Leu Asn Ala lie Leu 
40 45 50 

egg tgc gga teg tct ggc aga age aaa aga age gtc att gac gac ttt 8648 
Arg Cys Gly Ser Ser Gly Arg Ser Lys Arg Ser Val lie Asp Asp Phe 
55 60 65 70 

acc ctg ace age ccc tac ttg ggc aca tgc teg tac tgc cac cat act 8696 
Thr Leu Thr Ser Pro Tyr Leu Gly Thr Cys Ser Tyr Cys His His Thr 

75 80 85 

gta ccg tgc ttc age cct gtt aag ate gag cag gtc tgg gac gaa gcg 8744 
Val Pro Cys Phe Ser Pro Val Lys lie Glu Gin Val Trp Asp Glu Ala 

90 95 100 

gac gat aac acc ata cgc ata cag act tec gee cag ttt gga tac gac 8792 
Asp Asp Asn Thr lie Arg lie Gin Thr Ser Ala Gin Phe Gly Tyr Asp 
105 110 115 

caa age gga gca gca age gca aac aag tac cgc tac atg teg ctt aag 8840 
Gin Ser Gly Ala Ala Ser Ala Asn Lys Tyr Arg Tyr Met Ser Leu Lys 
120 125 130 

cag gat cac acc gtt aaa gaa ggc acc atg gat gac ate aag att age 8888 
Gin Asp His Thr Val Lys Glu Gly Thr Met Asp Asp lie Lys He Ser 
135 140 145 150 

acc tea gga ccg tgt aga agg ctt age tac aaa gga tac ttt etc etc 8936 
Thr Ser Gly Pro Cys Arg Arg Leu Ser Tyr Lys Gly Tyr Phe Leu Leu 

155 160 165 

gca aaa tgc cct cca ggg gac age gta acg gtt age ata gtg agt age 8984 
Ala Lys Cys Pro Pro Gly Asp Ser Val Thr Val Ser He Val Ser Ser 

170 175 180 

aac tea gca acg tea tgt aca ctg gee cgc aag ata aaa cca aaa ttc 9032 
Asn Ser Ala Thr Ser Cys Thr Leu Ala Arg Lys He Lys Pro Lys Phe 
185 190 195 

gtg gga egg gaa aaa tat gat eta cct ccc gtt cac ggt aaa aaa att 9080 
Val Gly Arg Glu Lys Tyr Asp Leu Pro Pro Val His Gly Lys Lys He 
200 205 210 
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cct tgc aca gtg tac gac cgt ctg aaa gaa aca act gca ggc tac ate 9128 
Pro Cys Thr Val Tyr Asp Arg Leu Lys Glu Thr Thr Ala Gly Tyr lie 
215 220 225 230 

act atg cac agg ccg aga ccg cac get tat aca tec tac ctg gaa gaa 9176 
Thr Met His Arg Pro Arg Pro His Ala Tyr Thr Ser Tyr Leu Glu Glu 

235 240 245 

tea tea ggg aaa gtt tac gca aag ccg cca tct ggg aag aac att acg 9224 
Ser Ser Gly Lys Val Tyr Ala Lys Pro Pro Ser Gly Lys Asn lie Thr 

250 255 260 

tat gag tgc aag tgc ggc gac tac aag ace gga ace gtt teg acc cgc 9272 
Tyr Glu Cys Lys Cys Gly Asp Tyr Lys Thr Gly Thr Val Ser Thr Arg 
265 270 275 

acc gaa ate act ggt tgc acc gec ate aag cag tgc gtc gee tat aag 9320 
Thr Glu He Thr Gly Cys Thr Ala He Lys Gin Cys Val Ala Tyr Lys 
280 285 290 

age gac caa acg aag tgg gtc ttc aac tea ccg gac ttg ate aga cat 93 68 

Ser Asp Gin Thr Lys Trp Val Phe Asn Ser Pro Asp Leu He Arg His 
295 300 305 310 

gac gac cac acg gee caa ggg aaa ttg cat ttg cct ttc aag ttg ate 9416 
Asp Asp His Thr Ala Gin Gly Lys Leu His Leu Pro Phe Lys Leu He 

315 320 325 

ccg agt acc tgc atg gtc cct gtt gee cac gcg ccg aat gta ata cat 9464 
Pro Ser Thr Cys Met Val Pro Val Ala His Ala Pro Asn Val He His 

330 335 340 

ggc ttt aaa cac ate age etc caa tta gat aca gac cac ttg aca ttg 9512 
Gly Phe Lys His He Ser Leu Gin Leu Asp Thr Asp His Leu Thr Leu 
345 350 355 

etc acc acc agg aga eta ggg gca aac ccg gaa cca acc act gaa tgg 9560 
Leu Thr Thr Arg Arg Leu Gly Ala Asn Pro Glu Pro Thr Thr Glu Trp 
360 365 370 

ate gtc gga aag acg gtc aga aac ttc acc gtc gac cga gat ggc ctg 9608 
He Val Gly Lys Thr Val Arg Asn Phe Thr Val Asp Arg Asp Gly Leu 
375 380 385 390 

gaa tac ata tgg gga aat cat gag cca gtg agg gtc tat gee caa gag 9656 
Glu Tyr He Trp Gly Asn His Glu Pro Val Arg Val Tyr Ala Gin Glu 

395 400 405 

tea gca cca gga gac cct cac gga tgg cca cac gaa ata gta cag cat 9704 
Ser Ala Pro Gly Asp Pro His Gly Trp Pro His Glu He Val Gin His 

410 415 420 

tac tac cat cgc cat cct gtg tac acc ate tta gee gtc gca tea get 9752 
Tyr Tyr His Arg His Pro Val Tyr Thr He Leu Ala Val Ala Ser Ala 
425 430 435 

acc gtg gcg atg atg att ggc gta act gtt gca gtg tta tgt gee tgt 9800 
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Thr Val Ala Met Met lie Gly Val Thr Val Ala Val Leu Cys Ala Cys 
440 445 450 



aaa gcg cgc cgt gag tgc ctg acg cca tac gcc ctg gcc cca aac gcc 9848 
Lys Ala Arg Arg Glu Cys Leu Thr Pro Tyr Ala Leu Ala Pro Asn Ala 
455 460 465 470 

gta ate cca act teg ctg gca etc ttg tgc tgc gtt agg teg gcc aat 9896 
Val lie Pro Thr Ser Leu Ala Leu Leu Cys Cys Val Arg Ser Ala Asn 

475 480 485 

get gaaacgttca ccgagaccat gagttacttg tggtcgaaca gtcagccgtt 9949 
Ala 



cttctgggtc 


cagttgtgca 


tacctttggc 


cgctttcatc 


gttctaatgc 


4™ j"** "H*" 4"™ 

gctgccgc tc 


i nnno 
luuuy 


ctgctgcctg 


ccttttttag 


tggttgccgg 


cgcctacctg 


gcgaaggtag 


aegee tacga 


j_ u u o y 


acatgcgacc 


actgttccaa 


atgtgccaca 


gatacegtat 


aaggcacttg 


ttgaaagggc 


i n i i q 


agggtatgee 


ccgctcaatt 


tggagatcac 


tgtcatgtcc 


tcggaggttt 


tgect tccac 


1 n 1 q q 


caaccaagag 


tacattacct 


gcaaattcac 


cactgtggtc 


ccctccccaa 


aaatcaaatg 


i no/io 


ctgcggctcc 


ttggaatgtc 


agccggccgc 


teatgeagae 


tatacctgea 


aggtc c tegg 


IUj U j 


aggggtctac 


ccctttatgt 


ggggaggagc 


gcaatgtttt 


tgcgacagtg 


agaacageca 


i n^Q 
IUjOj 


gatgagtgag 


gegtaegteg 


aattgtcagc 


agattgegeg 


tctgaccacg 


egcaggega l 


i n/iOQ 
iu4z y 


taaggtgcac 


actgccgcga 


tgaaagtagg 


actgegtatt 


gtgtacggga 


acaccaccag 


± U f± o y 


tttcctagat 


gtgtacgtga 


aeggagtcac 


accaggaacg 


tctaaagact 


tgaaagticac 


1 n ^ a q 
± u y 


agctggacca 


atttcagcat 


cgtttacgcc 


attcgatcat 


aaggtcgtta 


tccatcgcgg 


10609 


cctggtgtac 


aactatgact 


teceggaata 


tggagcgatg 


aaaccaggag 


cgtttggaga 


10669 


cattcaagct 


acctccttga 


ctagcaagga 


tctcatcgcc 


agcacagaca 


ttaggctact 


10729 


caagccttcc 


gecaagaacg 


tgcatgtccc 


gtacacgcag 


gcctcatcag 


gatttgagat 


10789 


gtggaaaaac 


aactcaggcc 


gcccactgca 


ggaaaccgea 


ectttegggt 


gtaagattgc 


10849 


agtaaatccg 


ctccgagcgg 


tggactgttc 


ataegggaac 


attcccattt 


ctattgacat 


10909 


cccgaacgct 


gectttatea 


ggacatcaga 


tgcaccactg 


gtctcaacag 


tcaaatgtga 


10969 


agtcagtgag 


tgcacttatt 


cagcagactt 


eggegggatg 


gccaccctgc 


agtatgtatc 


11029 


cgaccgcgaa 


ggtcaatgee 


ccgtacattc 


gcattcgagc 


acagcaactc 


tccaagagtc 


11089 


gacagtacat 


gtcctggaga 


aaggagcggt 


gacagtacac 


tttagcaccg 


cgagtccaca 


11149 


ggcgaacttt 


ategtatege 


tgtgtgggaa 


gaagacaaca 


tgcaatgcag 


aatgtaaacc 


11209 



26 



accagctgac catatcgtga gcaccccgca caaaaatgac caagaatttc aagccgccat 11269 

ctcaaaaaca tcatggagtt ggctgtttgc ccttttcggc ggcgcctcgt cgctattaat 11329 

tataggactt atgatttttg cttgcagcat gatgctgact agcacacgaa gatgaccgct 113 89 

acgccccaat gatccgacca gcaaaactcg atgtacttcc gaggaactga tgtgcataat 11449 

gcatcaggct ggtacattag atccccgctt accgcgggca atatagcaac actaaaaact 11509 

cgatgtactt ccgaggaagc gcagtgcata atgctgcgca gtgttgccac ataaccacta 11569 

tattaaccat ttatctagcg gacgccaaaa actcaatgta tttctgagga agcgtggtgc 11629 

ataatgccac gcagcgtctg cataactttt attatttctt ttattaatca acaaaatttt 11689 

gtttttaaca tttc 11703 



<210> 16 
<211> 64 
<212> PRT 

<213> Sindbis virus 
<400> 16 

Ser Ala Ala Pro Leu Val Thr Ala Met Cys Leu Leu Gly Asn Val Ser 
15 10 15 



Phe Pro Cys Asp Arg Pro Pro Thr Cys Tyr Thr Arg Glu Pro Ser Arg 

20 25 30 



Ala Leu Asp lie Leu Glu Glu Asn Val Asn His Glu Ala Tyr Asp Thr 
35 40 45 



Leu Leu Asn Ala He Leu Arg Cys Gly Ser Ser Gly Arg Ser Lys Arg 
50 55 60 



<210> 17 

<211> 423 

<212> PRT 

<213> Sindbis virus 

<400> 17 

Ser Val He Asp Asp Phe Thr Leu Thr Ser Pro Tyr Leu Gly Thr Cys 
15 10 15 



Ser Tyr Cys His His Thr Val Pro Cys Phe Ser Pro Val Lys He Glu 

20 25 30 
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Gin Val Trp Asp Glu Ala Asp Asp Asn Thr lie Arg lie Gin Thr Ser 
35 40 45 



Ala Gin Phe Gly Tyr Asp Gin Ser Gly Ala Ala Ser Ala Asn Lys Tyr 
50 55 60 



Arg Tyr Met Ser Leu Lys Gin Asp His Thr Val Lys Glu Gly Thr Met 
65 70 75 80 



Asp Asp lie Lys lie Ser Thr Ser Gly Pro Cys Arg Arg Leu Ser Tyr 

85 90 95 



Lys Gly Tyr Phe Leu Leu Ala Lys Cys Pro Pro Gly Asp Ser Val Thr 

100 105 110 



Val Ser lie Val Ser Ser Asn Ser Ala Thr Ser Cys Thr Leu Ala Arg 
115 120 125 



Lys lie Lys Pro Lys Phe Val Gly Arg Glu Lys Tyr Asp Leu Pro Pro 
130 135 140 



Val His Gly Lys Lys lie Pro Cys Thr Val Tyr Asp Arg Leu Lys Glu 
145 150 155 160 



Thr Thr Ala Gly Tyr lie Thr Met His Arg Pro Arg Pro His Ala Tyr 

165 170 175 



Thr Ser Tyr Leu Glu Glu Ser Ser Gly Lys Val Tyr Ala Lys Pro Pro 

180 185 190 



Ser Gly Lys Asn lie Thr Tyr Glu Cys Lys Cys Gly Asp Tyr Lys Thr 
195 200 205 



Gly Thr Val Ser Thr Arg Thr Glu lie Thr Gly Cys Thr Ala lie Lys 
210 215 220 



Gin Cys Val Ala Tyr Lys Ser Asp Gin Thr Lys Trp Val Phe Asn Ser 
225 230 235 240 



Pro Asp Leu lie Arg His Asp Asp His Thr Ala Gin Gly Lys Leu His 

245 250 255 



Leu Pro Phe Lys Leu lie Pro Ser Thr Cys Met Val Pro Val Ala His 
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260 



265 



270 



Ala Pro Asn Val lie His Gly Phe Lys His lie Ser Leu Gin Leu Asp 
275 280 285 



Thr Asp His Leu Thr Leu Leu Thr Thr Arg Arg Leu Gly Ala Asn Pro 
290 295 300 



Glu Pro Thr Thr Glu Trp lie Val Gly Lys Thr Val Arg Asn Phe Thr 
305 310 315 320 



Val Asp Arg Asp Gly Leu Glu Tyr lie Trp Gly Asn His Glu Pro Val 

325 330 335 



Arg Val Tyr Ala Gin Glu Ser Ala Pro Gly Asp Pro His Gly Trp Pro 

340 345 350 



His Glu lie Val Gin His Tyr Tyr His Arg His Pro Val Tyr Thr lie 
355 360 365 



Leu Ala Val Ala Ser Ala Thr Val Ala Met Met lie Gly Val Thr Val 
370 375 380 



Ala Val Leu Cys Ala Cys Lys Ala Arg Arg Glu Cys Leu Thr Pro Tyr 
385 390 395 400 



Ala Leu Ala Pro Asn Ala Val He Pro Thr Ser Leu Ala Leu Leu Cys 

405 410 415 



Cys Val Arg Ser Ala Asn Ala 

420 



<210> 18 

<211> 51 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Linker 



<220> 

<221> CDS 

<222> (1) . . (51) 

<400> 18 
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aga tct ggt ggc ggt ggc teg ggc ggt ggt ggg teg ggt ggc ggc gga 
Arg Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
15 10 15 



tct 
Ser 



<210> 19 
<211> 17 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic Construct 
<400> 19 

Arg Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
15 10 15 



Ser 



<210> 20 

<211> 45 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Linker 



<220> 

<221> CDS 
<222> (1)..{45) 

<400> 20 

ggt ggc ggt ggc teg ggc ggt ggt ggg teg ggt ggc ggc gga tct 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 

15 10 15 



<210> 


21 


<211> 


15 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


2 i 



Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 



30 



1 



5 



10 



15 



<210> 22 

<211> 1050 

<212> DNA 

<213> Bluetongue virus 10 



<220> 

<221> CDS 

<222> (1)..(1050) 

<223> VP7 gene 

<400> 22 

atg gac act ate gec gca aga gca etc act gtg atg cga gca tgt get 
Met Asp Thr lie Ala Ala Arg Ala Leu Thr Val Met Arg Ala Cys Ala 
15 10 15 

acg ctt caa gag gca aga att gtg ttg gaa gec aat gtg atg gaa att 
Thr Leu Gin Glu Ala Arg lie Val Leu Glu Ala Asn Val Met Glu lie 

20 25 30 

ttg ggg ata get ate aat agg tac aat gga etc act tta cga gga gtg 
Leu Gly lie Ala lie Asn Arg Tyr Asn Gly Leu Thr Leu Arg Gly Val 
35 40 45 

acg atg cgc ccg acc teg tta gca caa aga aat gag atg ttt ttt atg 
Thr Met Arg Pro Thr Ser Leu Ala Gin Arg Asn Glu Met Phe Phe Met 
50 55 60 

tgt ttg gat atg atg ctg tct get get ggg ata aat gtt gga ccg ata 
Cys Leu Asp Met Met Leu Ser Ala Ala Gly He Asn Val Gly Pro He 
65 70 75 80 

teg cca gac tat act caa cat atg get acg att ggt gta eta gca aca 
Ser Pro Asp Tyr Thr Gin His Met Ala Thr He Gly Val Leu Ala Thr 

85 90 95 

ccg gaa ata cct ttt aca acg gaa gcg gcg aat gaa ata gca cga gtg 
Pro Glu He Pro Phe Thr Thr Glu Ala Ala Asn Glu lie Ala Arg Val 

100 105 110 

act ggg gag act teg aca tgg ggg cca gcg cgt cag cct tat ggt ttc 
Thr Gly Glu Thr Ser Thr Trp Gly Pro Ala Arg Gin Pro Tyr Gly Phe 
115 120 125 

ttc ctt gaa act gag gaa acc ttc caa cca ggg agg tgg ttc atg cgc 
Phe Leu Glu Thr Glu Glu Thr Phe Gin Pro Gly Arg Trp Phe Met Arg 
130 135 140 

gee get caa gca gta act gca gta gtg tgc ggt ccg gat atg att caa 
Ala Ala Gin Ala Val Thr Ala Val Val Cys Gly Pro Asp Met He Gin 
145 150 155 160 

gtg tea ctt aat get gga gcg aga gga gat gta caa cag ata ttt cag 
Val Ser Leu Asn Ala Gly Ala Arg Gly Asp Val Gin Gin He Phe Gin 
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165 



170 



175 



ggt cgt aat gat ccc atg atg ata tat tta gtg tgg agg aga ate gaa 57 6 

Gly Arg Asn Asp Pro Met Met lie Tyr Leu Val Trp Arg Arg lie Glu 

180 185 190 

aac ttt gcg atg gcg caa ggt aat tea cag caa act caa gcg ggt gtg 624 
Asn Phe Ala Met Ala Gin Gly Asn Ser Gin Gin Thr Gin Ala Gly Val 
195 200 205 

act gtc agt gtt ggt gga gtt gac atg agg gcg gga cgc att ata gcg 672 
Thr Val Ser Val Gly Gly Val Asp Met Arg Ala Gly Arg He He Ala 
210 215 220 



tgg gat gga cag gec gcg ctg cat gtg cat aat ccg aca caa cag aat 
Trp Asp Gly Gin Ala Ala Leu His Val His Asn Pro Thr Gin Gin Asn 
225 230 235 240 



720 



gcg atg gtg caa ata cag gtt gtg ttc tat ata tct atg gat aaa act 768 
Ala Met Val Gin He Gin Val Val Phe Tyr He Ser Met Asp Lys Thr 

245 250 255 

tta aac cag tac ccc get ttg act get gag att ttc aat gtt tac age 816 
Leu Asn Gin Tyr Pro Ala Leu Thr Ala Glu He Phe Asn Val Tyr Ser 

260 265 270 

ttc agg gac cac aca tgg cat ggg eta aga acg gcg ata tta aac aga 864 
Phe Arg Asp His Thr Trp His Gly Leu Arg Thr Ala He Leu Asn Arg 
275 280 285 

acc aca ctg cca aac atg ctg cca cca ate ttc cca cca aat gat cga 912 
Thr Thr Leu Pro Asn Met Leu Pro Pro He Phe Pro Pro Asn Asp Arg 
290 295 300 

gat age ate tta act ctt eta ctt tta tct aca ctt get gat gtt tac 960 
Asp Ser He Leu Thr Leu Leu Leu Leu Ser Thr Leu Ala Asp Val Tyr 
305 310 315 320 

act gtt tta agg cca gag ttt gcg att cac ggc gta aat ccg atg cca 1008 
Thr Val Leu Arg Pro Glu Phe Ala He His Gly Val Asn Pro Met Pro 

325 330 335 

ggg ccg etc aca cgt get att gcg cgc gee gee tat gtg tag 1050 
Gly Pro Leu Thr Arg Ala He Ala Arg Ala Ala Tyr Val 

340 345 



<210> 23 

<211> 349 

<212> PRT 

<213> Bluetongue virus 10 

<400> 23 

Met Asp Thr He Ala Ala Arg Ala Leu Thr Val Met Arg Ala Cys Ala 
1 5 10 15 
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Thr Leu Gin Glu Ala Arg lie Val Leu Glu Ala Asn Val Met Glu lie 

20 25 30 



Leu Gly lie Ala lie Asn Arg Tyr Asn Gly Leu Thr Leu Arg Gly Val 
35 40 45 



Thr Met Arg Pro Thr Ser Leu Ala Gin Arg Asn Glu Met Phe Phe Met 
50 55 60 



Cys Leu Asp Met Met Leu Ser Ala Ala Gly lie Asn Val Gly Pro lie 
65 ~ 70 75 80 



Ser Pro Asp Tyr Thr Gin His Met Ala Thr lie Gly Val Leu Ala Thr 

85 90 95 



Pro Glu lie Pro Phe Thr Thr Glu Ala Ala Asn Glu lie Ala Arg Val 

100 105 110 



Thr Gly Glu Thr Ser Thr Trp Gly Pro Ala Arg Gin Pro Tyr Gly Phe 
115 120 125 



Phe Leu Glu Thr Glu Glu Thr Phe Gin Pro Gly Arg Trp Phe Met Arg 
130 135 140 



Ala Ala Gin Ala Val Thr Ala Val Val Cys Gly Pro Asp Met He Gin 
145 150 155 160 



Val Ser Leu Asn Ala Gly Ala Arg Gly Asp Val Gin Gin He Phe Gin 

165 170 175 



Gly Arg Asn Asp Pro Met Met He Tyr Leu Val Trp Arg Arg He Glu 

180 185 190 



Asn Phe Ala Met Ala Gin Gly Asn Ser Gin Gin Thr Gin Ala Gly Val 
195 200 205 



Thr Val Ser Val Gly Gly Val Asp Met Arg Ala Gly Arg He He Ala 
210 215 220 



Trp Asp Gly Gin Ala Ala Leu His Val His Asn Pro Thr Gin Gin Asn 
225 230 235 240 
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Ala Met Val Gin lie Gin Val Val Phe Tyr lie Ser Met Asp Lys Thr 

245 250 255 



Leu Asn Gin Tyr Pro Ala Leu Thr Ala Glu lie Phe Asn Val Tyr Ser 

260 265 270 



Phe Arg Asp His Thr Trp His Gly Leu Arg Thr Ala lie Leu Asn Arg 
275 280 285 



Thr Thr Leu Pro Asn Met Leu Pro Pro lie Phe Pro Pro Asn Asp Arg 
290 295 300 



Asp Ser He Leu Thr Leu Leu Leu Leu Ser Thr Leu Ala Asp Val Tyr 
305 310 315 320 



Thr Val Leu Arg Pro Glu Phe Ala He His Gly Val Asn Pro Met Pro 

325 330 335 



Gly Pro Leu Thr Arg Ala He Ala Arg Ala Ala Tyr Val 

340 345 



<210> 24 

<211> 585 

<212> DNA 

<213> Coccidioides immitis 



<220> 

<221> CDS 

<222> (1) . . (585) 

<223> Ag2 / PRA gene 

<400> 24 

atg cag ttc tct cac get etc ate get etc gtc get gec ggc etc gec 48 
Met Gin Phe Ser His Ala Leu He Ala Leu Val Ala Ala Gly Leu Ala 
15 10 15 

agt gec cag etc cca gac ate cca cct tgc get etc aac tgc ttc gtt 96 
Ser Ala Gin Leu Pro Asp He Pro Pro Cys Ala Leu Asn Cys Phe Val 

20 25 30 

gag get etc ggc aac gat ggc tgc act cgc ttg acc gac ttc aag tgc 144 
Glu Ala Leu Gly Asn Asp Gly Cys Thr Arg Leu Thr Asp Phe Lys Cys 
35 40 45 

cac tgc tec aag cct gag eta cca gga cag ate act cct tgc gtt gag 192 
His Cys Ser Lys Pro Glu Leu Pro Gly Gin He Thr Pro Cys Val Glu 
50 55 60 

gag gec tgc cct etc gac gec cgt ate tec gtc tec aac ate gtc gtt 240 
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Glu Ala Cys Pro Leu Asp Ala Arg lie Ser Val Ser Asn lie Val Val 
65 70 75 80 



gac cag tgc tec aag gec ggt gtc cca att gac ate cca cca gtt gac 288 
Asp Gin Cys Ser Lys Ala Gly Val Pro lie Asp lie Pro Pro Val Asp 

85 90 95 



acc ace gee get ccc gag cca tec gag acc get gag ccc acc get gag 

Thr Thr Ala Ala Pro Glu Pro Ser Glu Thr Ala Glu Pro Thr Ala Glu 

100 105 110 

cca acc gag gag ccc act gee gag cct acc get gag ccc acc get gag 

Pro Thr Glu Glu Pro Thr Ala Glu Pro Thr Ala Glu Pro Thr Ala Glu 

115 120 125 

ccg act cat gag ccc acc gag gag ccc act gee gtc cca acc ggc act 

Pro Thr His Glu Pro Thr Glu Glu Pro Thr Ala Val Pro Thr Gly Thr 
130 135 140 

ggc ggt ggt gtc ccc act ggc acc ggt tec ttc acc gtc act ggc aga 

Gly Gly Gly Val Pro Thr Gly Thr Gly Ser Phe Thr Val Thr Gly Arg 

145 150 155 160 

cca act gee tec acc cca get gag ttc cca ggt get ggc tec aac gtc 

Pro Thr Ala Ser Thr Pro Ala Glu Phe Pro Gly Ala Gly Ser Asn Val 

165 170 175 



tac ctg taa 
Tyr Leu 



<210> 25 
<211> 194 
<212> PRT 

<213> Coccidioides immitis 
<400> 25 

Met Gin Phe Ser His Ala Leu lie Ala Leu Val Ala Ala Gly Leu Ala 
15 10 15 



Ser Ala Gin Leu Pro Asp He Pro Pro Cys Ala Leu Asn Cys Phe Val 

20 25 30 



Glu Ala Leu Gly Asn Asp Gly Cys Thr Arg Leu Thr Asp Phe Lys Cys 
35 40 45 



His Cys Ser Lys Pro Glu Leu Pro Gly Gin He Thr Pro Cys Val Glu 
50 55 60 



336 



384 



432 



480 



528 



cgt gee age gtt ggc ggc att get get get etc etc ggt etc get gee 576 
Arg Ala Ser Val Gly Gly He Ala Ala Ala Leu Leu Gly Leu Ala Ala 

180 185 190 



585 



35 



Glu Ala Cys Pro Leu Asp Ala Arg lie Ser Val Ser Asn lie Val Val 
65 70 75 80 



Asp Gin Cys Ser Lys Ala Gly Val Pro lie Asp lie Pro Pro Val Asp 

85 90 95 



Thr Thr Ala Ala Pro Glu Pro Ser Glu Thr Ala Glu Pro Thr Ala Glu 

100 105 110 



Pro Thr Glu Glu Pro Thr Ala Glu Pro Thr Ala Glu Pro Thr Ala Glu 
115 120 125 



Pro Thr His Glu Pro Thr Glu Glu Pro Thr Ala Val Pro Thr Gly Thr 
130 135 140 



Gly Gly Gly Val Pro Thr Gly Thr Gly Ser Phe Thr Val Thr Gly Arg 
145 150 155 160 



Pro Thr Ala Ser Thr Pro Ala Glu Phe Pro Gly Ala Gly Ser Asn Val 

165 170 175 



Arg Ala Ser Val Gly Gly He Ala Ala Ala Leu Leu Gly Leu Ala Ala 

180 185 190 



Tyr Leu 



<210> 26 

<211> 906 

<212> DNA 

<213> Streptococcus pneumoniae 



<220> 

<221> CDS 

<222> (1) . . (906) 

<223> PspA gene 

<400> 26 

gaa gaa tct ccc gta gcc agt cag tct aaa get gag aaa gac tat gat 
Glu Glu Ser Pro Val Ala Ser Gin Ser Lys Ala Glu Lys Asp Tyr Asp 
15 10 15 



48 



gca gcg aag aaa gat get aag aat gcg aaa aaa gca gta gaa gat get 96 
Ala Ala Lys Lys Asp Ala Lys Asn Ala Lys Lys Ala Val Glu Asp Ala 

20 25 30 
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caa aag get tta gat gat gca aaa get get cag aaa aaa tat gac gag 
Gin Lys Ala Leu Asp Asp Ala Lys Ala Ala Gin Lys Lys Tyr Asp Glu 
35 40 45 

gat cag aag aaa act gag gag aaa gec gcg eta gaa aaa gca gcg tct 
Asp Gin Lys Lys Thr Glu Glu Lys Ala Ala Leu Glu Lys Ala Ala Ser 
50 55 60 

gaa gag atg gat aag gca gtg gca gca gtt caa caa gcg tat eta gec 
Glu Glu Met Asp Lys Ala Val Ala Ala Val Gin Gin Ala Tyr Leu Ala 
65 70 75 80 

tat caa caa get aca gac aaa gec gca aaa gac gca gca gat aag atg 
Tyr Gin Gin Ala Thr Asp Lys Ala Ala Lys Asp Ala Ala Asp Lys Met 

85 90 95 



144 



act gtt cga gca atg gta gtt cct gag cca gag cag ttg get gag act 
Thr Val Arg Ala Met Val Val Pro Glu Pro Glu Gin Leu Ala Glu Thr 
115 120 125 

aag aaa aaa tea gaa gaa get aaa caa aaa gca cca gaa ctt act aaa 
Lys Lys Lys Ser Glu Glu Ala Lys Gin Lys Ala Pro Glu Leu Thr Lys 
130 135 140 

aaa eta gaa gaa get aaa gca aaa tta gaa gag get gag aaa aaa get 
Lys Leu Glu Glu Ala Lys Ala Lys Leu Glu Glu Ala Glu Lys Lys Ala 
145 150 155 160 

act gaa gee aaa caa aaa gtg gat get gaa gaa gte get cct caa get 
Thr Glu Ala Lys Gin Lys Val Asp Ala Glu Glu Val Ala Pro Gin Ala 

165 170 175 



aaa gag att gat gag tct gaa tea gaa gat tat get aaa gaa ggt ttc 
Lys Glu lie Asp Glu Ser Glu Ser Glu Asp Tyr Ala Lys Glu Gly Phe 
195 200 205 

cgt get cct ctt caa tct aaa ttg gat gee aaa aaa get aaa eta tea 
Arg Ala Pro Leu Gin Ser Lys Leu Asp Ala Lys Lys Ala Lys Leu Ser 
210 215 220 

aaa ctt gaa gag tta agt gat aag att gat gag tta gac get gaa att 
Lys Leu Glu Glu Leu Ser Asp Lys lie Asp Glu Leu Asp Ala Glu lie 
225 230 235 240 

gca aaa ctt gaa gat caa ctt aaa get get gaa gaa aac aat aat gta 
Ala Lys Leu Glu Asp Gin Leu Lys Ala Ala Glu Glu Asn Asn Asn Val 

245 250 255 



192 



240 



288 



ata gat gaa get aag aaa cgc gaa gaa gag gca aaa act aaa ttt aat 33 6 

lie Asp Glu Ala Lys Lys Arg Glu Glu Glu Ala Lys Thr Lys Phe Asn 

100 105 110 



384 



432 



480 



528 



aaa ate get gaa ttg gaa aat caa gtt cat aga eta gaa caa gag etc 57 6 

Lys lie Ala Glu Leu Glu Asn Gin Val His Arg Leu Glu Gin Glu Leu 

180 185 190 



624 



672 



720 



768 



37 



gaa gac tac ttt aaa gaa ggt tta gag aaa act att get get aaa aaa 816 
Glu Asp Tyr Phe Lys Glu Gly Leu Glu Lys Thr lie Ala Ala Lys Lys 

260 265 270 



get gaa tta gaa aaa act gaa get gac ctt aag aaa gca gtt aat gag 
Ala Glu Leu Glu Lys Thr Glu Ala Asp Leu Lys Lys Ala Val Asn Glu 
275 280 285 



<210> 27 
<211> 302 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 27 

Glu Glu Ser Pro Val Ala Ser Gin Ser Lys Ala Glu Lys Asp Tyr Asp 
15 10 15 



Ala Ala Lys Lys Asp Ala Lys Asn Ala Lys Lys Ala Val Glu Asp Ala 

20 25 30 



Gin Lys Ala Leu Asp Asp Ala Lys Ala Ala Gin Lys Lys Tyr Asp Glu 
35 40 45 



Asp Gin Lys Lys Thr Glu Glu Lys Ala Ala Leu Glu Lys Ala Ala Ser 
50 55 60 



Glu Glu Met Asp Lys Ala Val Ala Ala Val Gin Gin Ala Tyr Leu Ala 
65 70 75 80 



Tyr Gin Gin Ala Thr Asp Lys Ala Ala Lys Asp Ala Ala Asp Lys Met 

85 90 95 



He Asp Glu Ala Lys Lys Arg Glu Glu Glu Ala Lys Thr Lys Phe Asn 

100 105 110 



Thr Val Arg Ala Met Val Val Pro Glu Pro Glu Gin Leu Ala Glu Thr 
115 120 125 



Lys Lys Lys Ser Glu Glu Ala Lys Gin Lys Ala Pro Glu Leu Thr Lys 
130 135 140 



Lys Leu Glu Glu Ala Lys Ala Lys Leu Glu Glu Ala Glu Lys Lys Ala 



864 



cca gaa aaa cca get cca get cca gaa act cca gee cca gaa 906 
Pro Glu Lys Pro Ala Pro Ala Pro Glu Thr Pro Ala Pro Glu 
290 295 300 



38 



145 



150 



155 



160 



Thr Glu Ala Lys Gin Lys Val Asp Ala Glu Glu Val Ala Pro Gin Ala 

165 170 175 



Lys lie Ala Glu Leu Glu Asn Gin Val His Arg Leu Glu Gin Glu Leu 

180 185 190 



Lys Glu He Asp Glu Ser Glu Ser Glu Asp Tyr Ala Lys Glu Gly Phe 
195 200 205 



Arg Ala Pro Leu Gin Ser Lys Leu Asp Ala Lys Lys Ala Lys Leu Ser 
210 215 220 



Lys Leu Glu Glu Leu Ser Asp Lys He Asp Glu Leu Asp Ala Glu He 
225 230 235 240 



Ala Lys Leu Glu Asp Gin Leu Lys Ala Ala Glu Glu Asn Asn Asn Val 

245 250 255 



Glu Asp Tyr Phe Lys Glu Gly Leu Glu Lys Thr He Ala Ala Lys Lys 

260 265 270 



Ala Glu Leu Glu Lys Thr Glu Ala Asp Leu Lys Lys Ala Val Asn Glu 
275 280 285 



Pro Glu Lys Pro Ala Pro Ala Pro Glu Thr Pro Ala Pro Glu 
290 295 300 



39 



